Closed-Loop Dynamic Control of a Soft Manipulator Using Deep Reinforcement Learning
نویسندگان
چکیده
The focus of the research community in soft robotic field has been on developing innovative materials, but design control strategies applicable to these platforms is still an open challenge. This due their highly nonlinear dynamics which difficult model and degree stochasticity they often incorporate. Data-driven controllers based neural networks have recently explored as a viable solution be employed for manipulators. letter presents network-based closed-loop controller, trained by deep reinforcement learning algorithm called Trust Region Policy Optimization (TRPO). training takes place simulation, using approximation robot forward dynamic obtained with Long-short Term Memory (LSTM) network. controller allows following different paths executed velocities workspace robot. results demonstrate that effective normal working conditions payload attached end-effector manipulator.
منابع مشابه
Closed-loop manipulator control using quaternion feedback
Some operational details of the zero reference positionmethod are presented in the context of deriving kinematical equations fora robot with a nonspherical continuous roll wrist.
متن کاملClosed-Loop Object Recognition Using Reinforcement Learning
Current computer vision systems whose basic methodology is open-loop or filter type typically use image segmentation followed by object recognition algorithms. These systems are not robust for most real-world applications. In contrast, the system presented here achieves robust performance by using reinforcement learning to induce a mapping from input images to corresponding segmentation paramet...
متن کاملReinforcement Learning for Mixed Open-loop and Closed-loop Control
Closed-loop control relies on sensory feedback that is usually assumed to be free . But if sensing incurs a cost, it may be costeffective to take sequences of actions in open-loop mode. We describe a reinforcement learning algorithm that learns to combine open-loop and closed-loop control when sensing incurs a cost. Although we assume reliable sensors, use of open-loop control means that action...
متن کاملDynamic and Closed-Loop Control
3 Classical closed-loop control 8 3.1 PID feedback . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 3.2 Transfer functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 3.3 Closed-loop stability . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 3.4 Gain and phase margins and robustness . . . . . . . . . . . . . . . . 12 3.5 Sensitivity function and fundamental...
متن کاملReinforcement Learning for Multi - Linked Manipulator Control
We present an automatic trajectory planning and obstacle avoidance method for a multi-linked manipulator which uses position and velocity sensor information directly to produce the appropriate real-valued torques for each joint. The inputs are fed into a Cerebellar Model Arithmetic Computer (CMAC) [1] and in each state, the expected reward and torques for each joint are learnt through self-expe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE robotics and automation letters
سال: 2022
ISSN: ['2377-3766']
DOI: https://doi.org/10.1109/lra.2022.3146903